Overview
Brought to you by YData
Dataset statistics
| Number of variables | 32 |
|---|---|
| Number of observations | 220450 |
| Missing cells | 239094 |
| Missing cells (%) | 3.4% |
| Duplicate rows | 15636 |
| Duplicate rows (%) | 7.1% |
| Total size in memory | 55.5 MiB |
| Average record size in memory | 264.0 B |
Variable types
| Categorical | 16 |
|---|---|
| Numeric | 14 |
| Text | 1 |
| Unsupported | 1 |
| Dataset has 15636 (7.1%) duplicate rows | Duplicates |
children is highly imbalanced (81.9%) | Imbalance |
babies is highly imbalanced (97.0%) | Imbalance |
meal is highly imbalanced (54.1%) | Imbalance |
distribution_channel is highly imbalanced (62.8%) | Imbalance |
is_repeated_guest is highly imbalanced (80.0%) | Imbalance |
reserved_room_type is highly imbalanced (59.8%) | Imbalance |
assigned_room_type is highly imbalanced (51.6%) | Imbalance |
deposit_type is highly imbalanced (63.8%) | Imbalance |
required_car_parking_spaces is highly imbalanced (85.2%) | Imbalance |
agent has 30206 (13.7%) missing values | Missing |
company has 207847 (94.3%) missing values | Missing |
adults is highly skewed (γ1 = 23.85907871) | Skewed |
previous_cancellations is highly skewed (γ1 = 20.25122199) | Skewed |
previous_bookings_not_canceled is highly skewed (γ1 = 24.34646594) | Skewed |
reservation_status_date is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lead_time has 12397 (5.6%) zeros | Zeros |
stays_in_weekend_nights has 97084 (44.0%) zeros | Zeros |
stays_in_week_nights has 14064 (6.4%) zeros | Zeros |
previous_cancellations has 203331 (92.2%) zeros | Zeros |
previous_bookings_not_canceled has 214322 (97.2%) zeros | Zeros |
booking_changes has 187752 (85.2%) zeros | Zeros |
days_in_waiting_list has 212514 (96.4%) zeros | Zeros |
adr has 4053 (1.8%) zeros | Zeros |
total_of_special_requests has 134849 (61.2%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-10 08:56:51.258721 |
|---|---|
| Analysis finished | 2025-04-10 08:57:33.692279 |
| Duration | 42.43 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
hotel
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| City Hotel | |
|---|---|
| Resort Hotel |
Length
| Max length | 12 |
|---|---|
| Median length | 10 |
| Mean length | 10.686369 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | City Hotel |
|---|---|
| 2nd row | City Hotel |
| 3rd row | City Hotel |
| 4th row | Resort Hotel |
| 5th row | Resort Hotel |
Common Values
| Value | Count | Frequency (%) |
| City Hotel | 144795 | |
| Resort Hotel | 75655 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| hotel | 220450 | |
| city | 144795 | |
| resort | 75655 | 17.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 440900 | |
| o | 296105 | |
| e | 296105 | |
| 220450 | ||
| H | 220450 | |
| l | 220450 | |
| C | 144795 | 6.1% |
| i | 144795 | 6.1% |
| y | 144795 | 6.1% |
| R | 75655 | 3.2% |
| Other values (2) | 151310 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2355810 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 440900 | |
| o | 296105 | |
| e | 296105 | |
| 220450 | ||
| H | 220450 | |
| l | 220450 | |
| C | 144795 | 6.1% |
| i | 144795 | 6.1% |
| y | 144795 | 6.1% |
| R | 75655 | 3.2% |
| Other values (2) | 151310 | 6.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2355810 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 440900 | |
| o | 296105 | |
| e | 296105 | |
| 220450 | ||
| H | 220450 | |
| l | 220450 | |
| C | 144795 | 6.1% |
| i | 144795 | 6.1% |
| y | 144795 | 6.1% |
| R | 75655 | 3.2% |
| Other values (2) | 151310 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2355810 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 440900 | |
| o | 296105 | |
| e | 296105 | |
| 220450 | ||
| H | 220450 | |
| l | 220450 | |
| C | 144795 | 6.1% |
| i | 144795 | 6.1% |
| y | 144795 | 6.1% |
| R | 75655 | 3.2% |
| Other values (2) | 151310 | 6.4% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 139205 | |
| 1 | 81245 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 139205 | |
| 1 | 81245 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 139205 | |
| 1 | 81245 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 139205 | |
| 1 | 81245 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 139205 | |
| 1 | 81245 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 139205 | |
| 1 | 81245 |
lead_time
Real number (ℝ)
Zeros 
| Distinct | 479 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 102.28062 |
| Minimum | 0 |
|---|---|
| Maximum | 737 |
| Zeros | 12397 |
| Zeros (%) | 5.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 17 |
| median | 66 |
| Q3 | 158 |
| 95-th percentile | 320 |
| Maximum | 737 |
| Range | 737 |
| Interquartile range (IQR) | 141 |
Descriptive statistics
| Standard deviation | 106.38601 |
|---|---|
| Coefficient of variation (CV) | 1.0401385 |
| Kurtosis | 1.381721 |
| Mean | 102.28062 |
| Median Absolute Deviation (MAD) | 58 |
| Skewness | 1.3054921 |
| Sum | 22547762 |
| Variance | 11317.983 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 12397 | 5.6% |
| 1 | 6607 | 3.0% |
| 2 | 3878 | 1.8% |
| 3 | 3498 | 1.6% |
| 4 | 3220 | 1.5% |
| 5 | 3052 | 1.4% |
| 6 | 2744 | 1.2% |
| 7 | 2428 | 1.1% |
| 8 | 2166 | 1.0% |
| 12 | 2143 | 1.0% |
| Other values (469) | 178317 |
| Value | Count | Frequency (%) |
| 0 | 12397 | |
| 1 | 6607 | |
| 2 | 3878 | 1.8% |
| 3 | 3498 | 1.6% |
| 4 | 3220 | 1.5% |
| 5 | 3052 | 1.4% |
| 6 | 2744 | 1.2% |
| 7 | 2428 | 1.1% |
| 8 | 2166 | 1.0% |
| 9 | 1840 | 0.8% |
| Value | Count | Frequency (%) |
| 737 | 3 | < 0.1% |
| 709 | 2 | < 0.1% |
| 629 | 17 | < 0.1% |
| 626 | 60 | |
| 622 | 17 | < 0.1% |
| 615 | 17 | < 0.1% |
| 608 | 17 | < 0.1% |
| 605 | 60 | |
| 601 | 17 | < 0.1% |
| 594 | 17 | < 0.1% |
arrival_date_year
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| 2019 | |
|---|---|
| 2016 | |
| 2017 | |
| 2018 | |
| 2015 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2015 |
|---|---|
| 2nd row | 2016 |
| 3rd row | 2016 |
| 4th row | 2016 |
| 5th row | 2015 |
Common Values
| Value | Count | Frequency (%) |
| 2019 | 79264 | |
| 2016 | 56609 | |
| 2017 | 40612 | |
| 2018 | 21996 | 10.0% |
| 2015 | 21969 | 10.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2019 | 79264 | |
| 2016 | 56609 | |
| 2017 | 40612 | |
| 2018 | 21996 | 10.0% |
| 2015 | 21969 | 10.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 220450 | |
| 0 | 220450 | |
| 1 | 220450 | |
| 9 | 79264 | 9.0% |
| 6 | 56609 | 6.4% |
| 7 | 40612 | 4.6% |
| 8 | 21996 | 2.5% |
| 5 | 21969 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 881800 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 220450 | |
| 0 | 220450 | |
| 1 | 220450 | |
| 9 | 79264 | 9.0% |
| 6 | 56609 | 6.4% |
| 7 | 40612 | 4.6% |
| 8 | 21996 | 2.5% |
| 5 | 21969 | 2.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 881800 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 220450 | |
| 0 | 220450 | |
| 1 | 220450 | |
| 9 | 79264 | 9.0% |
| 6 | 56609 | 6.4% |
| 7 | 40612 | 4.6% |
| 8 | 21996 | 2.5% |
| 5 | 21969 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 881800 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 220450 | |
| 0 | 220450 | |
| 1 | 220450 | |
| 9 | 79264 | 9.0% |
| 6 | 56609 | 6.4% |
| 7 | 40612 | 4.6% |
| 8 | 21996 | 2.5% |
| 5 | 21969 | 2.5% |
arrival_date_month
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| October | |
|---|---|
| August | |
| September | |
| July | |
| May | |
| Other values (7) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.1932502 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | September |
|---|---|
| 2nd row | September |
| 3rd row | March |
| 4th row | April |
| 5th row | August |
Common Values
| Value | Count | Frequency (%) |
| October | 27271 | |
| August | 26703 | |
| September | 26125 | |
| July | 22775 | |
| May | 17242 | |
| December | 16582 | |
| April | 16498 | |
| June | 16211 | |
| November | 15924 | |
| March | 14615 | |
| Other values (2) | 20504 |
Length
| Value | Count | Frequency (%) |
| october | 27271 | |
| august | 26703 | |
| september | 26125 | |
| july | 22775 | |
| may | 17242 | |
| december | 16582 | |
| april | 16498 | |
| june | 16211 | |
| november | 15924 | |
| march | 14615 | |
| Other values (2) | 20504 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 215702 | |
| r | 149770 | 11.0% |
| u | 112896 | 8.3% |
| b | 98153 | 7.2% |
| t | 80099 | 5.9% |
| a | 60614 | 4.4% |
| y | 60521 | 4.4% |
| m | 58631 | 4.3% |
| c | 58468 | 4.3% |
| J | 47239 | 3.5% |
| Other values (16) | 423209 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1365302 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 215702 | |
| r | 149770 | 11.0% |
| u | 112896 | 8.3% |
| b | 98153 | 7.2% |
| t | 80099 | 5.9% |
| a | 60614 | 4.4% |
| y | 60521 | 4.4% |
| m | 58631 | 4.3% |
| c | 58468 | 4.3% |
| J | 47239 | 3.5% |
| Other values (16) | 423209 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1365302 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 215702 | |
| r | 149770 | 11.0% |
| u | 112896 | 8.3% |
| b | 98153 | 7.2% |
| t | 80099 | 5.9% |
| a | 60614 | 4.4% |
| y | 60521 | 4.4% |
| m | 58631 | 4.3% |
| c | 58468 | 4.3% |
| J | 47239 | 3.5% |
| Other values (16) | 423209 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1365302 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 215702 | |
| r | 149770 | 11.0% |
| u | 112896 | 8.3% |
| b | 98153 | 7.2% |
| t | 80099 | 5.9% |
| a | 60614 | 4.4% |
| y | 60521 | 4.4% |
| m | 58631 | 4.3% |
| c | 58468 | 4.3% |
| J | 47239 | 3.5% |
| Other values (16) | 423209 |
arrival_date_week_number
Real number (ℝ)
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.961061 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 19 |
| median | 32 |
| Q3 | 41 |
| 95-th percentile | 50 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 13.562111 |
|---|---|
| Coefficient of variation (CV) | 0.45265789 |
| Kurtosis | -0.92573271 |
| Mean | 29.961061 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | -0.27303218 |
| Sum | 6604916 |
| Variance | 183.93085 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33 | 7097 | 3.2% |
| 41 | 6797 | 3.1% |
| 42 | 6729 | 3.1% |
| 38 | 6693 | 3.0% |
| 39 | 6464 | 2.9% |
| 32 | 5842 | 2.7% |
| 40 | 5822 | 2.6% |
| 34 | 5791 | 2.6% |
| 30 | 5728 | 2.6% |
| 43 | 5635 | 2.6% |
| Other values (43) | 157852 |
| Value | Count | Frequency (%) |
| 1 | 1286 | 0.6% |
| 2 | 1648 | |
| 3 | 1777 | |
| 4 | 2068 | |
| 5 | 1969 | |
| 6 | 2255 | |
| 7 | 3194 | |
| 8 | 3234 | |
| 9 | 3184 | |
| 10 | 3195 |
| Value | Count | Frequency (%) |
| 53 | 4451 | |
| 52 | 2900 | |
| 51 | 2181 | |
| 50 | 3617 | |
| 49 | 4411 | |
| 48 | 3620 | |
| 47 | 4051 | |
| 46 | 3569 | |
| 45 | 4428 | |
| 44 | 5411 |
arrival_date_day_of_month
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.78763 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.7597191 |
|---|---|
| Coefficient of variation (CV) | 0.55484701 |
| Kurtosis | -1.1902355 |
| Mean | 15.78763 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.0015869541 |
| Sum | 3480383 |
| Variance | 76.73268 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 28 | 10017 | 4.5% |
| 5 | 8470 | 3.8% |
| 17 | 8358 | 3.8% |
| 12 | 7692 | 3.5% |
| 16 | 7662 | 3.5% |
| 18 | 7658 | 3.5% |
| 25 | 7624 | 3.5% |
| 26 | 7579 | 3.4% |
| 15 | 7512 | 3.4% |
| 9 | 7511 | 3.4% |
| Other values (21) | 140367 |
| Value | Count | Frequency (%) |
| 1 | 6484 | |
| 2 | 7270 | |
| 3 | 7075 | |
| 4 | 7013 | |
| 5 | 8470 | |
| 6 | 7025 | |
| 7 | 6887 | |
| 8 | 7325 | |
| 9 | 7511 | |
| 10 | 6653 |
| Value | Count | Frequency (%) |
| 31 | 4169 | |
| 30 | 7445 | |
| 29 | 3574 | 1.6% |
| 28 | 10017 | |
| 27 | 6938 | |
| 26 | 7579 | |
| 25 | 7624 | |
| 24 | 7375 | |
| 23 | 6671 | |
| 22 | 6522 |
stays_in_weekend_nights
Real number (ℝ)
Zeros 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.91920163 |
| Minimum | 0 |
|---|---|
| Maximum | 19 |
| Zeros | 97084 |
| Zeros (%) | 44.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.99688745 |
|---|---|
| Coefficient of variation (CV) | 1.0845144 |
| Kurtosis | 7.2428249 |
| Mean | 0.91920163 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.3831452 |
| Sum | 202638 |
| Variance | 0.99378459 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 97084 | |
| 2 | 60901 | |
| 1 | 56108 | |
| 4 | 3375 | 1.5% |
| 3 | 2364 | 1.1% |
| 6 | 238 | 0.1% |
| 5 | 164 | 0.1% |
| 8 | 108 | < 0.1% |
| 7 | 48 | < 0.1% |
| 9 | 25 | < 0.1% |
| Other values (7) | 35 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 97084 | |
| 1 | 56108 | |
| 2 | 60901 | |
| 3 | 2364 | 1.1% |
| 4 | 3375 | 1.5% |
| 5 | 164 | 0.1% |
| 6 | 238 | 0.1% |
| 7 | 48 | < 0.1% |
| 8 | 108 | < 0.1% |
| 9 | 25 | < 0.1% |
| Value | Count | Frequency (%) |
| 19 | 2 | < 0.1% |
| 18 | 3 | < 0.1% |
| 16 | 4 | < 0.1% |
| 14 | 4 | < 0.1% |
| 13 | 5 | < 0.1% |
| 12 | 8 | < 0.1% |
| 10 | 9 | < 0.1% |
| 9 | 25 | < 0.1% |
| 8 | 108 | |
| 7 | 48 |
stays_in_week_nights
Real number (ℝ)
Zeros 
| Distinct | 35 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.4786029 |
| Minimum | 0 |
|---|---|
| Maximum | 50 |
| Zeros | 14064 |
| Zeros (%) | 6.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 50 |
| Range | 50 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.8963995 |
|---|---|
| Coefficient of variation (CV) | 0.76510824 |
| Kurtosis | 24.352696 |
| Mean | 2.4786029 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.8462936 |
| Sum | 546408 |
| Variance | 3.596331 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 63893 | |
| 1 | 56810 | |
| 3 | 39485 | |
| 5 | 19988 | 9.1% |
| 4 | 17176 | 7.8% |
| 0 | 14064 | 6.4% |
| 6 | 2840 | 1.3% |
| 10 | 1949 | 0.9% |
| 7 | 1912 | 0.9% |
| 8 | 1208 | 0.5% |
| Other values (25) | 1125 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 14064 | 6.4% |
| 1 | 56810 | |
| 2 | 63893 | |
| 3 | 39485 | |
| 4 | 17176 | 7.8% |
| 5 | 19988 | 9.1% |
| 6 | 2840 | 1.3% |
| 7 | 1912 | 0.9% |
| 8 | 1208 | 0.5% |
| 9 | 420 | 0.2% |
| Value | Count | Frequency (%) |
| 50 | 2 | < 0.1% |
| 42 | 3 | < 0.1% |
| 41 | 2 | < 0.1% |
| 40 | 2 | < 0.1% |
| 35 | 2 | < 0.1% |
| 34 | 2 | < 0.1% |
| 33 | 3 | < 0.1% |
| 32 | 1 | < 0.1% |
| 30 | 8 | |
| 26 | 1 | < 0.1% |
adults
Real number (ℝ)
Skewed 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8499887 |
| Minimum | 0 |
|---|---|
| Maximum | 55 |
| Zeros | 719 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 55 |
| Range | 55 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.62476855 |
|---|---|
| Coefficient of variation (CV) | 0.3377148 |
| Kurtosis | 1623.8275 |
| Mean | 1.8499887 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 23.859079 |
| Sum | 407830 |
| Variance | 0.39033575 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 166178 | |
| 1 | 43168 | 19.6% |
| 3 | 10227 | 4.6% |
| 0 | 719 | 0.3% |
| 4 | 110 | < 0.1% |
| 26 | 15 | < 0.1% |
| 27 | 6 | < 0.1% |
| 20 | 6 | < 0.1% |
| 5 | 6 | < 0.1% |
| 40 | 3 | < 0.1% |
| Other values (4) | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 719 | 0.3% |
| 1 | 43168 | 19.6% |
| 2 | 166178 | |
| 3 | 10227 | 4.6% |
| 4 | 110 | < 0.1% |
| 5 | 6 | < 0.1% |
| 6 | 3 | < 0.1% |
| 10 | 3 | < 0.1% |
| 20 | 6 | < 0.1% |
| 26 | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 55 | 3 | < 0.1% |
| 50 | 3 | < 0.1% |
| 40 | 3 | < 0.1% |
| 27 | 6 | < 0.1% |
| 26 | 15 | < 0.1% |
| 20 | 6 | < 0.1% |
| 10 | 3 | < 0.1% |
| 6 | 3 | < 0.1% |
| 5 | 6 | < 0.1% |
| 4 | 110 |
children
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 12 |
| Missing (%) | < 0.1% |
| Memory size | 3.4 MiB |
| 0.0 | |
|---|---|
| 1.0 | 8214 |
| 2.0 | 6223 |
| 3.0 | 120 |
| 10.0 | 4 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.0000181 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 205877 | |
| 1.0 | 8214 | 3.7% |
| 2.0 | 6223 | 2.8% |
| 3.0 | 120 | 0.1% |
| 10.0 | 4 | < 0.1% |
| (Missing) | 12 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 205877 | |
| 1.0 | 8214 | 3.7% |
| 2.0 | 6223 | 2.8% |
| 3.0 | 120 | 0.1% |
| 10.0 | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 426319 | |
| . | 220438 | |
| 1 | 8218 | 1.2% |
| 2 | 6223 | 0.9% |
| 3 | 120 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 661318 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 426319 | |
| . | 220438 | |
| 1 | 8218 | 1.2% |
| 2 | 6223 | 0.9% |
| 3 | 120 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 661318 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 426319 | |
| . | 220438 | |
| 1 | 8218 | 1.2% |
| 2 | 6223 | 0.9% |
| 3 | 120 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 661318 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 426319 | |
| . | 220438 | |
| 1 | 8218 | 1.2% |
| 2 | 6223 | 0.9% |
| 3 | 120 | < 0.1% |
babies
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| 0 | |
|---|---|
| 1 | 1760 |
| 2 | 27 |
| 9 | 3 |
| 10 | 2 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000091 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 218658 | |
| 1 | 1760 | 0.8% |
| 2 | 27 | < 0.1% |
| 9 | 3 | < 0.1% |
| 10 | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 218658 | |
| 1 | 1760 | 0.8% |
| 2 | 27 | < 0.1% |
| 9 | 3 | < 0.1% |
| 10 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 218660 | |
| 1 | 1762 | 0.8% |
| 2 | 27 | < 0.1% |
| 9 | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 220452 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 218660 | |
| 1 | 1762 | 0.8% |
| 2 | 27 | < 0.1% |
| 9 | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 220452 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 218660 | |
| 1 | 1762 | 0.8% |
| 2 | 27 | < 0.1% |
| 9 | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 220452 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 218660 | |
| 1 | 1762 | 0.8% |
| 2 | 27 | < 0.1% |
| 9 | 3 | < 0.1% |
meal
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| BB | |
|---|---|
| HB | |
| SC | 16520 |
| Undefined | 2128 |
| FB | 1929 |
Length
| Max length | 9 |
|---|---|
| Median length | 2 |
| Mean length | 2.0675709 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BB |
|---|---|
| 2nd row | SC |
| 3rd row | SC |
| 4th row | BB |
| 5th row | BB |
Common Values
| Value | Count | Frequency (%) |
| BB | 171514 | |
| HB | 28359 | 12.9% |
| SC | 16520 | 7.5% |
| Undefined | 2128 | 1.0% |
| FB | 1929 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bb | 171514 | |
| hb | 28359 | 12.9% |
| sc | 16520 | 7.5% |
| undefined | 2128 | 1.0% |
| fb | 1929 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 373316 | |
| H | 28359 | 6.2% |
| S | 16520 | 3.6% |
| C | 16520 | 3.6% |
| n | 4256 | 0.9% |
| d | 4256 | 0.9% |
| e | 4256 | 0.9% |
| U | 2128 | 0.5% |
| f | 2128 | 0.5% |
| i | 2128 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 455796 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 373316 | |
| H | 28359 | 6.2% |
| S | 16520 | 3.6% |
| C | 16520 | 3.6% |
| n | 4256 | 0.9% |
| d | 4256 | 0.9% |
| e | 4256 | 0.9% |
| U | 2128 | 0.5% |
| f | 2128 | 0.5% |
| i | 2128 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 455796 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 373316 | |
| H | 28359 | 6.2% |
| S | 16520 | 3.6% |
| C | 16520 | 3.6% |
| n | 4256 | 0.9% |
| d | 4256 | 0.9% |
| e | 4256 | 0.9% |
| U | 2128 | 0.5% |
| f | 2128 | 0.5% |
| i | 2128 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 455796 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 373316 | |
| H | 28359 | 6.2% |
| S | 16520 | 3.6% |
| C | 16520 | 3.6% |
| n | 4256 | 0.9% |
| d | 4256 | 0.9% |
| e | 4256 | 0.9% |
| U | 2128 | 0.5% |
| f | 2128 | 0.5% |
| i | 2128 | 0.5% |
country
Text
| Distinct | 177 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1029 |
| Missing (%) | 0.5% |
| Memory size | 3.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.9906572 |
| Min length | 2 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | BEL |
|---|---|
| 2nd row | DEU |
| 3rd row | ESP |
| 4th row | PRT |
| 5th row | PRT |
| Value | Count | Frequency (%) |
| prt | 97933 | |
| gbr | 20412 | 9.3% |
| fra | 18290 | 8.3% |
| esp | 16456 | 7.5% |
| deu | 12188 | 5.6% |
| ita | 6768 | 3.1% |
| irl | 5829 | 2.7% |
| bel | 3893 | 1.8% |
| nld | 3602 | 1.6% |
| bra | 3583 | 1.6% |
| Other values (167) | 30467 | 13.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 153017 | |
| P | 116732 | |
| T | 107988 | |
| E | 38218 | 5.8% |
| A | 37318 | 5.7% |
| B | 28534 | 4.3% |
| S | 25487 | 3.9% |
| U | 22198 | 3.4% |
| G | 22165 | 3.4% |
| F | 19201 | 2.9% |
| Other values (16) | 85355 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 656213 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 153017 | |
| P | 116732 | |
| T | 107988 | |
| E | 38218 | 5.8% |
| A | 37318 | 5.7% |
| B | 28534 | 4.3% |
| S | 25487 | 3.9% |
| U | 22198 | 3.4% |
| G | 22165 | 3.4% |
| F | 19201 | 2.9% |
| Other values (16) | 85355 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 656213 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 153017 | |
| P | 116732 | |
| T | 107988 | |
| E | 38218 | 5.8% |
| A | 37318 | 5.7% |
| B | 28534 | 4.3% |
| S | 25487 | 3.9% |
| U | 22198 | 3.4% |
| G | 22165 | 3.4% |
| F | 19201 | 2.9% |
| Other values (16) | 85355 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 656213 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 153017 | |
| P | 116732 | |
| T | 107988 | |
| E | 38218 | 5.8% |
| A | 37318 | 5.7% |
| B | 28534 | 4.3% |
| S | 25487 | 3.9% |
| U | 22198 | 3.4% |
| G | 22165 | 3.4% |
| F | 19201 | 2.9% |
| Other values (16) | 85355 |
market_segment
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| Online TA | |
|---|---|
| Offline TA/TO | |
| Groups | |
| Direct | |
| Corporate | |
| Other values (3) | 1808 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 9.054602 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Online TA |
|---|---|
| 2nd row | Online TA |
| 3rd row | Online TA |
| 4th row | Direct |
| 5th row | Direct |
Common Values
| Value | Count | Frequency (%) |
| Online TA | 96581 | |
| Offline TA/TO | 48882 | |
| Groups | 40038 | |
| Direct | 22925 | 10.4% |
| Corporate | 10216 | 4.6% |
| Complementary | 1440 | 0.7% |
| Aviation | 362 | 0.2% |
| Undefined | 6 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| online | 96581 | |
| ta | 96581 | |
| offline | 48882 | |
| ta/to | 48882 | |
| groups | 40038 | |
| direct | 22925 | 6.3% |
| corporate | 10216 | 2.8% |
| complementary | 1440 | 0.4% |
| aviation | 362 | 0.1% |
| undefined | 6 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 243858 | |
| O | 194345 | |
| T | 194345 | |
| e | 181496 | |
| i | 169118 | |
| l | 146903 | 7.4% |
| A | 145825 | 7.3% |
| 145463 | 7.3% | |
| f | 97770 | 4.9% |
| r | 84835 | 4.3% |
| Other values (16) | 392129 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1996087 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 243858 | |
| O | 194345 | |
| T | 194345 | |
| e | 181496 | |
| i | 169118 | |
| l | 146903 | 7.4% |
| A | 145825 | 7.3% |
| 145463 | 7.3% | |
| f | 97770 | 4.9% |
| r | 84835 | 4.3% |
| Other values (16) | 392129 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1996087 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 243858 | |
| O | 194345 | |
| T | 194345 | |
| e | 181496 | |
| i | 169118 | |
| l | 146903 | 7.4% |
| A | 145825 | 7.3% |
| 145463 | 7.3% | |
| f | 97770 | 4.9% |
| r | 84835 | 4.3% |
| Other values (16) | 392129 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1996087 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 243858 | |
| O | 194345 | |
| T | 194345 | |
| e | 181496 | |
| i | 169118 | |
| l | 146903 | 7.4% |
| A | 145825 | 7.3% |
| 145463 | 7.3% | |
| f | 97770 | 4.9% |
| r | 84835 | 4.3% |
| Other values (16) | 392129 |
distribution_channel
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| TA/TO | |
|---|---|
| Direct | |
| Corporate | 12916 |
| GDS | 299 |
| Undefined | 15 |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 5.354797 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TA/TO |
|---|---|
| 2nd row | TA/TO |
| 3rd row | TA/TO |
| 4th row | Direct |
| 5th row | Direct |
Common Values
| Value | Count | Frequency (%) |
| TA/TO | 180131 | |
| Direct | 27089 | 12.3% |
| Corporate | 12916 | 5.9% |
| GDS | 299 | 0.1% |
| Undefined | 15 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ta/to | 180131 | |
| direct | 27089 | 12.3% |
| corporate | 12916 | 5.9% |
| gds | 299 | 0.1% |
| undefined | 15 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 360262 | |
| / | 180131 | |
| O | 180131 | |
| A | 180131 | |
| r | 52921 | 4.5% |
| e | 40035 | 3.4% |
| t | 40005 | 3.4% |
| D | 27388 | 2.3% |
| i | 27104 | 2.3% |
| c | 27089 | 2.3% |
| Other values (10) | 65268 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1180465 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| T | 360262 | |
| / | 180131 | |
| O | 180131 | |
| A | 180131 | |
| r | 52921 | 4.5% |
| e | 40035 | 3.4% |
| t | 40005 | 3.4% |
| D | 27388 | 2.3% |
| i | 27104 | 2.3% |
| c | 27089 | 2.3% |
| Other values (10) | 65268 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1180465 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| T | 360262 | |
| / | 180131 | |
| O | 180131 | |
| A | 180131 | |
| r | 52921 | 4.5% |
| e | 40035 | 3.4% |
| t | 40005 | 3.4% |
| D | 27388 | 2.3% |
| i | 27104 | 2.3% |
| c | 27089 | 2.3% |
| Other values (10) | 65268 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1180465 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| T | 360262 | |
| / | 180131 | |
| O | 180131 | |
| A | 180131 | |
| r | 52921 | 4.5% |
| e | 40035 | 3.4% |
| t | 40005 | 3.4% |
| D | 27388 | 2.3% |
| i | 27104 | 2.3% |
| c | 27089 | 2.3% |
| Other values (10) | 65268 | 5.5% |
is_repeated_guest
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| 0 | |
|---|---|
| 1 | 6868 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 213582 | |
| 1 | 6868 | 3.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 213582 | |
| 1 | 6868 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 213582 | |
| 1 | 6868 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 213582 | |
| 1 | 6868 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 213582 | |
| 1 | 6868 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 213582 | |
| 1 | 6868 | 3.1% |
previous_cancellations
Real number (ℝ)
Skewed  Zeros 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.12571105 |
| Minimum | 0 |
|---|---|
| Maximum | 26 |
| Zeros | 203331 |
| Zeros (%) | 92.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.0493607 |
|---|---|
| Coefficient of variation (CV) | 8.3474028 |
| Kurtosis | 451.78693 |
| Mean | 0.12571105 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 20.251222 |
| Sum | 27713 |
| Variance | 1.101158 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 203331 | |
| 1 | 16162 | 7.3% |
| 2 | 226 | 0.1% |
| 24 | 144 | 0.1% |
| 3 | 132 | 0.1% |
| 26 | 78 | < 0.1% |
| 25 | 75 | < 0.1% |
| 11 | 72 | < 0.1% |
| 19 | 57 | < 0.1% |
| 4 | 43 | < 0.1% |
| Other values (5) | 130 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 203331 | |
| 1 | 16162 | 7.3% |
| 2 | 226 | 0.1% |
| 3 | 132 | 0.1% |
| 4 | 43 | < 0.1% |
| 5 | 32 | < 0.1% |
| 6 | 29 | < 0.1% |
| 11 | 72 | < 0.1% |
| 13 | 24 | < 0.1% |
| 14 | 42 | < 0.1% |
| Value | Count | Frequency (%) |
| 26 | 78 | |
| 25 | 75 | |
| 24 | 144 | |
| 21 | 3 | < 0.1% |
| 19 | 57 | < 0.1% |
| 14 | 42 | < 0.1% |
| 13 | 24 | < 0.1% |
| 11 | 72 | |
| 6 | 29 | < 0.1% |
| 5 | 32 | < 0.1% |
previous_bookings_not_canceled
Real number (ℝ)
Skewed  Zeros 
| Distinct | 73 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.12007258 |
| Minimum | 0 |
|---|---|
| Maximum | 72 |
| Zeros | 214322 |
| Zeros (%) | 97.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 72 |
| Range | 72 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.3664408 |
|---|---|
| Coefficient of variation (CV) | 11.380124 |
| Kurtosis | 817.78214 |
| Mean | 0.12007258 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 24.346466 |
| Sum | 26470 |
| Variance | 1.8671606 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 214322 | |
| 1 | 2671 | 1.2% |
| 2 | 992 | 0.4% |
| 3 | 572 | 0.3% |
| 4 | 389 | 0.2% |
| 5 | 313 | 0.1% |
| 6 | 191 | 0.1% |
| 7 | 143 | 0.1% |
| 8 | 114 | 0.1% |
| 9 | 93 | < 0.1% |
| Other values (63) | 650 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 214322 | |
| 1 | 2671 | 1.2% |
| 2 | 992 | 0.4% |
| 3 | 572 | 0.3% |
| 4 | 389 | 0.2% |
| 5 | 313 | 0.1% |
| 6 | 191 | 0.1% |
| 7 | 143 | 0.1% |
| 8 | 114 | 0.1% |
| 9 | 93 | < 0.1% |
| Value | Count | Frequency (%) |
| 72 | 1 | |
| 71 | 1 | |
| 70 | 1 | |
| 69 | 1 | |
| 68 | 1 | |
| 67 | 1 | |
| 66 | 1 | |
| 65 | 1 | |
| 64 | 1 | |
| 63 | 1 |
reserved_room_type
Categorical
Imbalance 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| A | |
|---|---|
| D | |
| E | 11350 |
| F | 5071 |
| G | 3625 |
| Other values (5) | 4954 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | D |
| 5th row | D |
Common Values
| Value | Count | Frequency (%) |
| A | 162415 | |
| D | 33035 | 15.0% |
| E | 11350 | 5.1% |
| F | 5071 | 2.3% |
| G | 3625 | 1.6% |
| B | 2281 | 1.0% |
| C | 1558 | 0.7% |
| H | 1078 | 0.5% |
| P | 19 | < 0.1% |
| L | 18 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 162415 | |
| d | 33035 | 15.0% |
| e | 11350 | 5.1% |
| f | 5071 | 2.3% |
| g | 3625 | 1.6% |
| b | 2281 | 1.0% |
| c | 1558 | 0.7% |
| h | 1078 | 0.5% |
| p | 19 | < 0.1% |
| l | 18 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 162415 | |
| D | 33035 | 15.0% |
| E | 11350 | 5.1% |
| F | 5071 | 2.3% |
| G | 3625 | 1.6% |
| B | 2281 | 1.0% |
| C | 1558 | 0.7% |
| H | 1078 | 0.5% |
| P | 19 | < 0.1% |
| L | 18 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 162415 | |
| D | 33035 | 15.0% |
| E | 11350 | 5.1% |
| F | 5071 | 2.3% |
| G | 3625 | 1.6% |
| B | 2281 | 1.0% |
| C | 1558 | 0.7% |
| H | 1078 | 0.5% |
| P | 19 | < 0.1% |
| L | 18 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 162415 | |
| D | 33035 | 15.0% |
| E | 11350 | 5.1% |
| F | 5071 | 2.3% |
| G | 3625 | 1.6% |
| B | 2281 | 1.0% |
| C | 1558 | 0.7% |
| H | 1078 | 0.5% |
| P | 19 | < 0.1% |
| L | 18 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 162415 | |
| D | 33035 | 15.0% |
| E | 11350 | 5.1% |
| F | 5071 | 2.3% |
| G | 3625 | 1.6% |
| B | 2281 | 1.0% |
| C | 1558 | 0.7% |
| H | 1078 | 0.5% |
| P | 19 | < 0.1% |
| L | 18 | < 0.1% |
assigned_room_type
Categorical
Imbalance 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| A | |
|---|---|
| D | |
| E | |
| F | 6821 |
| G | 4543 |
| Other values (7) | 11339 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | D |
| 5th row | D |
Common Values
| Value | Count | Frequency (%) |
| A | 137866 | |
| D | 45811 | 20.8% |
| E | 14070 | 6.4% |
| F | 6821 | 3.1% |
| G | 4543 | 2.1% |
| B | 4540 | 2.1% |
| C | 4321 | 2.0% |
| H | 1310 | 0.6% |
| I | 668 | 0.3% |
| K | 478 | 0.2% |
| Other values (2) | 22 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| a | 137866 | |
| d | 45811 | 20.8% |
| e | 14070 | 6.4% |
| f | 6821 | 3.1% |
| g | 4543 | 2.1% |
| b | 4540 | 2.1% |
| c | 4321 | 2.0% |
| h | 1310 | 0.6% |
| i | 668 | 0.3% |
| k | 478 | 0.2% |
| Other values (2) | 22 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 137866 | |
| D | 45811 | 20.8% |
| E | 14070 | 6.4% |
| F | 6821 | 3.1% |
| G | 4543 | 2.1% |
| B | 4540 | 2.1% |
| C | 4321 | 2.0% |
| H | 1310 | 0.6% |
| I | 668 | 0.3% |
| K | 478 | 0.2% |
| Other values (2) | 22 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 137866 | |
| D | 45811 | 20.8% |
| E | 14070 | 6.4% |
| F | 6821 | 3.1% |
| G | 4543 | 2.1% |
| B | 4540 | 2.1% |
| C | 4321 | 2.0% |
| H | 1310 | 0.6% |
| I | 668 | 0.3% |
| K | 478 | 0.2% |
| Other values (2) | 22 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 137866 | |
| D | 45811 | 20.8% |
| E | 14070 | 6.4% |
| F | 6821 | 3.1% |
| G | 4543 | 2.1% |
| B | 4540 | 2.1% |
| C | 4321 | 2.0% |
| H | 1310 | 0.6% |
| I | 668 | 0.3% |
| K | 478 | 0.2% |
| Other values (2) | 22 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 137866 | |
| D | 45811 | 20.8% |
| E | 14070 | 6.4% |
| F | 6821 | 3.1% |
| G | 4543 | 2.1% |
| B | 4540 | 2.1% |
| C | 4321 | 2.0% |
| H | 1310 | 0.6% |
| I | 668 | 0.3% |
| K | 478 | 0.2% |
| Other values (2) | 22 | < 0.1% |
booking_changes
Real number (ℝ)
Zeros 
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.21308233 |
| Minimum | 0 |
|---|---|
| Maximum | 21 |
| Zeros | 187752 |
| Zeros (%) | 85.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.63584996 |
|---|---|
| Coefficient of variation (CV) | 2.9840576 |
| Kurtosis | 88.245898 |
| Mean | 0.21308233 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.2374415 |
| Sum | 46974 |
| Variance | 0.40430517 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 187752 | |
| 1 | 23462 | 10.6% |
| 2 | 6557 | 3.0% |
| 3 | 1595 | 0.7% |
| 4 | 626 | 0.3% |
| 5 | 205 | 0.1% |
| 6 | 103 | < 0.1% |
| 7 | 55 | < 0.1% |
| 8 | 29 | < 0.1% |
| 9 | 16 | < 0.1% |
| Other values (11) | 50 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 187752 | |
| 1 | 23462 | 10.6% |
| 2 | 6557 | 3.0% |
| 3 | 1595 | 0.7% |
| 4 | 626 | 0.3% |
| 5 | 205 | 0.1% |
| 6 | 103 | < 0.1% |
| 7 | 55 | < 0.1% |
| 8 | 29 | < 0.1% |
| 9 | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 21 | 2 | < 0.1% |
| 20 | 3 | < 0.1% |
| 18 | 1 | < 0.1% |
| 17 | 5 | |
| 16 | 3 | < 0.1% |
| 15 | 5 | |
| 14 | 7 | |
| 13 | 9 | |
| 12 | 4 | |
| 11 | 4 |
deposit_type
Categorical
Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| No Deposit | |
|---|---|
| Non Refund | |
| Refundable | 300 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No Deposit |
|---|---|
| 2nd row | No Deposit |
| 3rd row | No Deposit |
| 4th row | No Deposit |
| 5th row | No Deposit |
Common Values
| Value | Count | Frequency (%) |
| No Deposit | 191328 | |
| Non Refund | 28822 | 13.1% |
| Refundable | 300 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 191328 | |
| deposit | 191328 | |
| non | 28822 | 6.5% |
| refund | 28822 | 6.5% |
| refundable | 300 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 411478 | |
| e | 220750 | |
| N | 220150 | |
| 220150 | ||
| s | 191328 | |
| i | 191328 | |
| t | 191328 | |
| p | 191328 | |
| D | 191328 | |
| n | 57944 | 2.6% |
| Other values (7) | 117388 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2204500 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 411478 | |
| e | 220750 | |
| N | 220150 | |
| 220150 | ||
| s | 191328 | |
| i | 191328 | |
| t | 191328 | |
| p | 191328 | |
| D | 191328 | |
| n | 57944 | 2.6% |
| Other values (7) | 117388 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2204500 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 411478 | |
| e | 220750 | |
| N | 220150 | |
| 220150 | ||
| s | 191328 | |
| i | 191328 | |
| t | 191328 | |
| p | 191328 | |
| D | 191328 | |
| n | 57944 | 2.6% |
| Other values (7) | 117388 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2204500 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 411478 | |
| e | 220750 | |
| N | 220150 | |
| 220150 | ||
| s | 191328 | |
| i | 191328 | |
| t | 191328 | |
| p | 191328 | |
| D | 191328 | |
| n | 57944 | 2.6% |
| Other values (7) | 117388 | 5.3% |
agent
Real number (ℝ)
Missing 
| Distinct | 333 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 30206 |
| Missing (%) | 13.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84.16739 |
| Minimum | 1 |
|---|---|
| Maximum | 535 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 9 |
| median | 14 |
| Q3 | 201 |
| 95-th percentile | 250 |
| Maximum | 535 |
| Range | 534 |
| Interquartile range (IQR) | 192 |
Descriptive statistics
| Standard deviation | 107.98789 |
|---|---|
| Coefficient of variation (CV) | 1.2830134 |
| Kurtosis | -0.21732489 |
| Mean | 84.16739 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 1.0575014 |
| Sum | 16012341 |
| Variance | 11661.384 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 52933 | |
| 240 | 25658 | |
| 1 | 18250 | 8.3% |
| 6 | 7044 | 3.2% |
| 7 | 5998 | 2.7% |
| 14 | 5990 | 2.7% |
| 250 | 5040 | 2.3% |
| 28 | 3105 | 1.4% |
| 241 | 3050 | 1.4% |
| 3 | 2893 | 1.3% |
| Other values (323) | 60283 | |
| (Missing) | 30206 |
| Value | Count | Frequency (%) |
| 1 | 18250 | 8.3% |
| 2 | 346 | 0.2% |
| 3 | 2893 | 1.3% |
| 4 | 133 | 0.1% |
| 5 | 760 | 0.3% |
| 6 | 7044 | 3.2% |
| 7 | 5998 | 2.7% |
| 8 | 2788 | 1.3% |
| 9 | 52933 | |
| 10 | 495 | 0.2% |
| Value | Count | Frequency (%) |
| 535 | 3 | < 0.1% |
| 531 | 68 | |
| 527 | 35 | |
| 526 | 8 | < 0.1% |
| 510 | 2 | < 0.1% |
| 509 | 10 | < 0.1% |
| 508 | 6 | < 0.1% |
| 502 | 24 | < 0.1% |
| 497 | 1 | < 0.1% |
| 495 | 57 |
company
Real number (ℝ)
Missing 
| Distinct | 352 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 207847 |
| Missing (%) | 94.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 176.09244 |
| Minimum | 6 |
|---|---|
| Maximum | 543 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 40 |
| Q1 | 47 |
| median | 169 |
| Q3 | 238 |
| 95-th percentile | 405 |
| Maximum | 543 |
| Range | 537 |
| Interquartile range (IQR) | 191 |
Descriptive statistics
| Standard deviation | 124.11385 |
|---|---|
| Coefficient of variation (CV) | 0.70482216 |
| Kurtosis | -0.44328574 |
| Mean | 176.09244 |
| Median Absolute Deviation (MAD) | 106 |
| Skewness | 0.61107357 |
| Sum | 2219293 |
| Variance | 15404.248 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40 | 1905 | 0.9% |
| 223 | 1534 | 0.7% |
| 45 | 495 | 0.2% |
| 67 | 435 | 0.2% |
| 281 | 399 | 0.2% |
| 153 | 320 | 0.1% |
| 174 | 294 | 0.1% |
| 154 | 242 | 0.1% |
| 233 | 227 | 0.1% |
| 219 | 217 | 0.1% |
| Other values (342) | 6535 | 3.0% |
| (Missing) | 207847 |
| Value | Count | Frequency (%) |
| 6 | 2 | < 0.1% |
| 8 | 3 | < 0.1% |
| 9 | 85 | |
| 10 | 2 | < 0.1% |
| 11 | 3 | < 0.1% |
| 12 | 27 | < 0.1% |
| 14 | 13 | < 0.1% |
| 16 | 11 | < 0.1% |
| 18 | 2 | < 0.1% |
| 20 | 109 |
| Value | Count | Frequency (%) |
| 543 | 2 | < 0.1% |
| 541 | 1 | < 0.1% |
| 539 | 2 | < 0.1% |
| 534 | 2 | < 0.1% |
| 531 | 1 | < 0.1% |
| 530 | 5 | < 0.1% |
| 528 | 2 | < 0.1% |
| 525 | 15 | |
| 523 | 19 | |
| 521 | 7 | < 0.1% |
days_in_waiting_list
Real number (ℝ)
Zeros 
| Distinct | 128 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.6934543 |
| Minimum | 0 |
|---|---|
| Maximum | 391 |
| Zeros | 212514 |
| Zeros (%) | 96.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 391 |
| Range | 391 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 18.627174 |
|---|---|
| Coefficient of variation (CV) | 6.9157195 |
| Kurtosis | 160.05919 |
| Mean | 2.6934543 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.971006 |
| Sum | 593772 |
| Variance | 346.97163 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 212514 | |
| 58 | 492 | 0.2% |
| 39 | 451 | 0.2% |
| 44 | 278 | 0.1% |
| 31 | 253 | 0.1% |
| 87 | 240 | 0.1% |
| 69 | 214 | 0.1% |
| 35 | 191 | 0.1% |
| 50 | 186 | 0.1% |
| 46 | 183 | 0.1% |
| Other values (118) | 5448 | 2.5% |
| Value | Count | Frequency (%) |
| 0 | 212514 | |
| 1 | 20 | < 0.1% |
| 2 | 8 | < 0.1% |
| 3 | 118 | 0.1% |
| 4 | 45 | < 0.1% |
| 5 | 10 | < 0.1% |
| 6 | 38 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 11 | < 0.1% |
| 9 | 29 | < 0.1% |
| Value | Count | Frequency (%) |
| 391 | 90 | |
| 379 | 30 | < 0.1% |
| 330 | 30 | < 0.1% |
| 259 | 20 | < 0.1% |
| 236 | 69 | |
| 224 | 20 | < 0.1% |
| 223 | 119 | |
| 215 | 42 | < 0.1% |
| 207 | 30 | < 0.1% |
| 193 | 2 | < 0.1% |
customer_type
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| Transient | |
|---|---|
| Transient-Party | |
| Contract | 10160 |
| Group | 1131 |
Length
| Max length | 15 |
|---|---|
| Median length | 9 |
| Mean length | 10.34895 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Transient |
|---|---|
| 2nd row | Transient |
| 3rd row | Transient |
| 4th row | Transient |
| 5th row | Transient |
Common Values
| Value | Count | Frequency (%) |
| Transient | 157149 | |
| Transient-Party | 52010 | 23.6% |
| Contract | 10160 | 4.6% |
| Group | 1131 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| transient | 157149 | |
| transient-party | 52010 | 23.6% |
| contract | 10160 | 4.6% |
| group | 1131 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 428478 | |
| t | 281489 | |
| r | 272460 | |
| a | 271329 | |
| T | 209159 | |
| s | 209159 | |
| i | 209159 | |
| e | 209159 | |
| y | 52010 | 2.3% |
| - | 52010 | 2.3% |
| Other values (7) | 87014 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2281426 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 428478 | |
| t | 281489 | |
| r | 272460 | |
| a | 271329 | |
| T | 209159 | |
| s | 209159 | |
| i | 209159 | |
| e | 209159 | |
| y | 52010 | 2.3% |
| - | 52010 | 2.3% |
| Other values (7) | 87014 | 3.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2281426 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 428478 | |
| t | 281489 | |
| r | 272460 | |
| a | 271329 | |
| T | 209159 | |
| s | 209159 | |
| i | 209159 | |
| e | 209159 | |
| y | 52010 | 2.3% |
| - | 52010 | 2.3% |
| Other values (7) | 87014 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2281426 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 428478 | |
| t | 281489 | |
| r | 272460 | |
| a | 271329 | |
| T | 209159 | |
| s | 209159 | |
| i | 209159 | |
| e | 209159 | |
| y | 52010 | 2.3% |
| - | 52010 | 2.3% |
| Other values (7) | 87014 | 3.8% |
adr
Real number (ℝ)
Zeros 
| Distinct | 8876 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 97.911112 |
| Minimum | -6.38 |
|---|---|
| Maximum | 5400 |
| Zeros | 4053 |
| Zeros (%) | 1.8% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | -6.38 |
|---|---|
| 5-th percentile | 37 |
| Q1 | 65 |
| median | 90 |
| Q3 | 120 |
| 95-th percentile | 186 |
| Maximum | 5400 |
| Range | 5406.38 |
| Interquartile range (IQR) | 55 |
Descriptive statistics
| Standard deviation | 49.23318 |
|---|---|
| Coefficient of variation (CV) | 0.50283547 |
| Kurtosis | 1221.5127 |
| Mean | 97.911112 |
| Median Absolute Deviation (MAD) | 26.85 |
| Skewness | 12.188218 |
| Sum | 21584505 |
| Variance | 2423.906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 62 | 9618 | 4.4% |
| 75 | 5164 | 2.3% |
| 65 | 4858 | 2.2% |
| 90 | 4483 | 2.0% |
| 0 | 4053 | 1.8% |
| 80 | 3242 | 1.5% |
| 100 | 2856 | 1.3% |
| 60 | 2844 | 1.3% |
| 95 | 2719 | 1.2% |
| 85 | 2678 | 1.2% |
| Other values (8866) | 177935 |
| Value | Count | Frequency (%) |
| -6.38 | 1 | < 0.1% |
| 0 | 4053 | |
| 0.26 | 1 | < 0.1% |
| 0.5 | 2 | < 0.1% |
| 1 | 29 | < 0.1% |
| 1.29 | 2 | < 0.1% |
| 1.48 | 2 | < 0.1% |
| 1.56 | 3 | < 0.1% |
| 1.6 | 3 | < 0.1% |
| 1.8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 5400 | 2 | |
| 510 | 1 | < 0.1% |
| 508 | 3 | |
| 451.5 | 2 | |
| 450 | 1 | < 0.1% |
| 437 | 1 | < 0.1% |
| 426.25 | 1 | < 0.1% |
| 402 | 1 | < 0.1% |
| 397.38 | 1 | < 0.1% |
| 392 | 2 |
required_car_parking_spaces
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| 0 | |
|---|---|
| 1 | 13906 |
| 2 | 46 |
| 3 | 4 |
| 8 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 206492 | |
| 1 | 13906 | 6.3% |
| 2 | 46 | < 0.1% |
| 3 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 206492 | |
| 1 | 13906 | 6.3% |
| 2 | 46 | < 0.1% |
| 3 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 206492 | |
| 1 | 13906 | 6.3% |
| 2 | 46 | < 0.1% |
| 3 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 206492 | |
| 1 | 13906 | 6.3% |
| 2 | 46 | < 0.1% |
| 3 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 206492 | |
| 1 | 13906 | 6.3% |
| 2 | 46 | < 0.1% |
| 3 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 220450 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 206492 | |
| 1 | 13906 | 6.3% |
| 2 | 46 | < 0.1% |
| 3 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
total_of_special_requests
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.53800862 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 134849 |
| Zeros (%) | 61.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.77661042 |
|---|---|
| Coefficient of variation (CV) | 1.4434907 |
| Kurtosis | 1.6078064 |
| Mean | 0.53800862 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.405831 |
| Sum | 118604 |
| Variance | 0.60312374 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 134849 | |
| 1 | 57986 | |
| 2 | 22856 | 10.4% |
| 3 | 4192 | 1.9% |
| 4 | 505 | 0.2% |
| 5 | 62 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 134849 | |
| 1 | 57986 | |
| 2 | 22856 | 10.4% |
| 3 | 4192 | 1.9% |
| 4 | 505 | 0.2% |
| 5 | 62 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 62 | < 0.1% |
| 4 | 505 | 0.2% |
| 3 | 4192 | 1.9% |
| 2 | 22856 | 10.4% |
| 1 | 57986 | |
| 0 | 134849 |
reservation_status
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| Check-Out | |
|---|---|
| Canceled | |
| No-Show | 2326 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.6213155 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Check-Out |
|---|---|
| 2nd row | Check-Out |
| 3rd row | Check-Out |
| 4th row | Canceled |
| 5th row | Check-Out |
Common Values
| Value | Count | Frequency (%) |
| Check-Out | 139295 | |
| Canceled | 78829 | |
| No-Show | 2326 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| check-out | 139295 | |
| canceled | 78829 | |
| no-show | 2326 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 296953 | |
| C | 218124 | |
| c | 218124 | |
| h | 141621 | |
| - | 141621 | |
| u | 139295 | |
| t | 139295 | |
| O | 139295 | |
| k | 139295 | |
| a | 78829 | 4.1% |
| Other values (7) | 248117 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1900569 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 296953 | |
| C | 218124 | |
| c | 218124 | |
| h | 141621 | |
| - | 141621 | |
| u | 139295 | |
| t | 139295 | |
| O | 139295 | |
| k | 139295 | |
| a | 78829 | 4.1% |
| Other values (7) | 248117 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1900569 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 296953 | |
| C | 218124 | |
| c | 218124 | |
| h | 141621 | |
| - | 141621 | |
| u | 139295 | |
| t | 139295 | |
| O | 139295 | |
| k | 139295 | |
| a | 78829 | 4.1% |
| Other values (7) | 248117 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1900569 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 296953 | |
| C | 218124 | |
| c | 218124 | |
| h | 141621 | |
| - | 141621 | |
| u | 139295 | |
| t | 139295 | |
| O | 139295 | |
| k | 139295 | |
| a | 78829 | 4.1% |
| Other values (7) | 248117 |
reservation_status_date
Unsupported
Rejected  Unsupported 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
Interactions
Missing values
Sample
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | City Hotel | 0 | 21 | 2015 | September | 36 | 1 | 0 | 4 | 2 | 0.0 | 0 | BB | BEL | Online TA | TA/TO | 0 | 0 | 0 | A | A | 2 | No Deposit | 9.0 | NaN | 0 | Transient | 105.0 | 0 | 0 | Check-Out | 2015-09-05 |
| 1 | City Hotel | 0 | 20 | 2016 | September | 38 | 12 | 1 | 0 | 1 | 0.0 | 0 | SC | DEU | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 89.0 | 0 | 2 | Check-Out | 2016-09-13 |
| 2 | City Hotel | 0 | 2 | 2016 | March | 13 | 24 | 0 | 2 | 2 | 0.0 | 0 | SC | ESP | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 134.0 | 0 | 1 | Check-Out | 2016-03-26 |
| 3 | Resort Hotel | 1 | 6 | 2016 | April | 17 | 21 | 0 | 1 | 2 | 0.0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | D | D | 0 | No Deposit | NaN | NaN | 0 | Transient | 73.0 | 0 | 0 | Canceled | 2016-04-18 |
| 4 | Resort Hotel | 0 | 40 | 2015 | August | 34 | 20 | 2 | 3 | 2 | 0.0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | D | D | 0 | No Deposit | 250.0 | NaN | 0 | Transient | 176.8 | 1 | 1 | Check-Out | 2015-08-25 |
| 5 | City Hotel | 0 | 256 | 2017 | July | 29 | 21 | 1 | 2 | 2 | 0.0 | 0 | BB | DEU | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0 | Transient-Party | 107.1 | 0 | 2 | Check-Out | 2017-07-24 |
| 6 | City Hotel | 1 | 77 | 2015 | July | 29 | 13 | 1 | 2 | 2 | 0.0 | 0 | BB | PRT | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 76.5 | 0 | 1 | Canceled | 2015-06-29 |
| 7 | City Hotel | 0 | 1 | 2016 | August | 32 | 4 | 0 | 1 | 2 | 0.0 | 0 | BB | BEL | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 151.0 | 0 | 1 | Check-Out | 2016-08-05 |
| 8 | City Hotel | 0 | 150 | 2017 | April | 14 | 2 | 2 | 2 | 2 | 1.0 | 0 | BB | FRA | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 135.0 | 0 | 2 | Check-Out | 2017-04-06 |
| 9 | Resort Hotel | 0 | 90 | 2017 | June | 26 | 28 | 2 | 5 | 2 | 0.0 | 0 | BB | IRL | Direct | Direct | 0 | 0 | 0 | A | A | 0 | No Deposit | NaN | NaN | 0 | Transient | 127.0 | 0 | 0 | Check-Out | 2017-07-05 |
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 79254 | Resort Hotel | 1 | 45 | 2019 | March | 10 | 1 | 0 | 5 | 2 | 0.0 | 0 | HB | PRT | Groups | Direct | 0 | 0 | 0 | E | E | 0 | No Deposit | NaN | NaN | 0 | Transient-Party | 81.00 | 0 | 0 | Check-Out | 2019-01-20 00:00:00 |
| 79255 | Resort Hotel | 1 | 45 | 2019 | March | 10 | 1 | 0 | 5 | 2 | 0.0 | 0 | HB | PRT | Groups | Direct | 0 | 0 | 0 | E | E | 0 | No Deposit | NaN | NaN | 0 | Transient-Party | 81.00 | 0 | 0 | Check-Out | 2019-01-20 00:00:00 |
| 79256 | Resort Hotel | 1 | 33 | 2019 | March | 10 | 1 | 2 | 6 | 2 | 0.0 | 0 | HB | PRT | Groups | Direct | 0 | 0 | 0 | A | A | 0 | No Deposit | NaN | NaN | 0 | Transient-Party | 65.00 | 0 | 0 | Check-Out | 2019-02-25 00:00:00 |
| 79257 | Resort Hotel | 1 | 57 | 2019 | March | 10 | 1 | 2 | 5 | 2 | 0.0 | 0 | BB | PRT | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 240.0 | NaN | 0 | Transient | 48.00 | 0 | 1 | Check-Out | 2019-01-15 00:00:00 |
| 79258 | Resort Hotel | 1 | 33 | 2019 | March | 10 | 1 | 2 | 6 | 1 | 0.0 | 0 | HB | PRT | Groups | Direct | 0 | 0 | 0 | A | A | 0 | No Deposit | NaN | NaN | 0 | Transient-Party | 50.00 | 0 | 0 | Check-Out | 2019-02-25 00:00:00 |
| 79259 | Resort Hotel | 1 | 61 | 2019 | March | 10 | 1 | 4 | 10 | 2 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 171.0 | NaN | 0 | Transient | 29.00 | 0 | 0 | Check-Out | 2019-01-06 00:00:00 |
| 79260 | Resort Hotel | 1 | 219 | 2019 | March | 10 | 2 | 2 | 5 | 2 | 0.0 | 0 | HB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 310.0 | NaN | 0 | Transient | 52.00 | 0 | 0 | Check-Out | 2018-11-20 00:00:00 |
| 79261 | Resort Hotel | 1 | 219 | 2019 | March | 10 | 2 | 2 | 5 | 2 | 0.0 | 0 | HB | CN | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 310.0 | NaN | 0 | Transient | 52.00 | 0 | 0 | Check-Out | 2018-11-20 00:00:00 |
| 79262 | Resort Hotel | 1 | 219 | 2019 | March | 10 | 2 | 2 | 5 | 2 | 0.0 | 0 | HB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 310.0 | NaN | 0 | Transient | 52.00 | 0 | 0 | Check-Out | 2018-11-20 00:00:00 |
| 79263 | Resort Hotel | 1 | 118 | 2019 | March | 10 | 2 | 2 | 5 | 2 | 0.0 | 0 | BB | PRT | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 241.0 | NaN | 0 | Transient | 33.26 | 0 | 0 | Check-Out | 2019-01-27 00:00:00 |
Duplicate rows
Most frequently occurring
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10238 | City Hotel | 1 | 277 | 2016 | November | 46 | 7 | 1 | 2 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | NaN | NaN | 0 | Transient | 100.0 | 0 | 0 | Canceled | 180 |
| 10242 | City Hotel | 1 | 277 | 2019 | November | 46 | 7 | 1 | 2 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | NaN | NaN | 0 | Transient | 100.0 | 0 | 0 | Canceled | 180 |
| 8100 | City Hotel | 1 | 68 | 2016 | February | 8 | 17 | 0 | 2 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 1 | 0 | A | A | 0 | Non Refund | 37.0 | NaN | 0 | Transient | 75.0 | 0 | 0 | Canceled | 150 |
| 8113 | City Hotel | 1 | 68 | 2019 | February | 8 | 17 | 0 | 2 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 1 | 0 | A | A | 0 | Non Refund | 37.0 | NaN | 0 | Transient | 75.0 | 0 | 0 | Canceled | 150 |
| 7500 | City Hotel | 1 | 34 | 2018 | December | 50 | 8 | 0 | 2 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 1 | 0 | A | A | 0 | Non Refund | 19.0 | NaN | 0 | Transient | 90.0 | 0 | 0 | Canceled | 140 |
| 7504 | City Hotel | 1 | 34 | 2019 | December | 50 | 8 | 0 | 2 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 1 | 0 | A | A | 0 | Non Refund | 19.0 | NaN | 0 | Transient | 90.0 | 0 | 0 | Canceled | 140 |
| 7487 | City Hotel | 1 | 34 | 2015 | December | 50 | 8 | 0 | 2 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 1 | 0 | A | A | 0 | Non Refund | 19.0 | NaN | 0 | Transient | 90.0 | 0 | 0 | Canceled | 139 |
| 9649 | City Hotel | 1 | 188 | 2019 | June | 25 | 15 | 0 | 2 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 119.0 | NaN | 39 | Transient | 130.0 | 0 | 0 | Canceled | 109 |
| 9645 | City Hotel | 1 | 188 | 2016 | June | 25 | 15 | 0 | 2 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 119.0 | NaN | 39 | Transient | 130.0 | 0 | 0 | Canceled | 108 |
| 9333 | City Hotel | 1 | 158 | 2016 | May | 22 | 24 | 0 | 2 | 1 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 37.0 | NaN | 31 | Transient | 130.0 | 0 | 0 | Canceled | 101 |